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ftrtitlciall tntctligence, Language and the Studyol Knowledge 

Artificial Intelligence (AJ) represents a new style of thinking about cognition that is having an 
important impact on the stud/ of language. Tht goats of the essay are; 

■ to convey a sense of tht intellectual personality of Artificial Intelligence, and 

- to identify some trends in its development which we see as especially relevant to she study 
of language and. of the role of an A! approach to language in education. 


1* Artificial Inteilia&ncii aa tiie Study qI TnlreHifleni Fr&ceases 

Before we embark on the substance of this essay. »f is worthwhile to clarify a potential source 
□f confusion. For many, Al is identified as a narrowly focussed field directed towards the goal of 
programing computers in such a fashion that they acquire the appearance of intelligence. Thus it 
may seem paradoxical that researchers in the field have anything to say about the structure of 
human language or related issues in education. However, the above description u misleading. It 
correctly delineates the major methodology of the science, that is, the use of computers to build 
precise models of cognitive theories. But it mistakenly identifies this as the only purpose of the 
field. Although there ii much practical good shat can come of more intelligent machines, the 
fundamental theoretical goal of the discipline is understanding intelligent processes independent of 
their particular physical reaSuiTjon. 
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Naturally, understanding intelligence abstractly hi* intimate interactions with understanding' It 
in its natural form. Thus, AT and psychology have dose tics. Indeed, At has been called 
■theoretical psychology" (Newell, 39^5), AT djaws from psychology a set of basic concerns — 
understanding language, perception, memory, problem solving. Psychology, in turn, acquires a new 
framework of computational ideas Tot ex pressing cognitive theories. Indeed, a major purpose of 
psychology becomes the discovery not of the class of programs that could possibly explain a given 
intelligent behavior (this is the archetypical concern, of ,M), but rather the particular program that 
a given Individual actually possesses. 


1.1 A Knowledge- O riented rather than PowersQrienled Theory of Intellig e nce 

i- 

Given the general perception df AE as the computational study of intelLigence, it still remains 
10 identify the particular set of questions and theoretical constructs that a researcher in the field 
brings to bear an cognitive problems. En the earliest years of the field [Le. during the late fifties 
and early Sixties), the hope was that a Tew .simple but Vrry powerful techniques could be identified 
that together could be used to create intelligent program*. During this period, theorem proving 
and search played a primary nok. Indeed, to many Al has become identified a? the science of 
heuristic search,. that Ji, the stody of procedural techniques tor exploring state spaces too large CO be 
explored :n an exhaustive fashion. During these early year?, an A] paper on language might have 
begun by representing the state Space by means of a formal grammar and then have focussed on 
the issue of achieving successful parsings in reasonable times. (Green, 3S63; Lindsay, 1965). 


Today there has been a shift in paradigm. The fundamental problem of understanding 
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intelligence li norths idem ifKaUon of a few powerful techniques, buf rather the question of how 
TO represent large Of knowledge T?i * fuhim that permits Ihetr effective me Onti interaction. 

This ihift ii based on a decade of exprnenoe with programs that relied on uniform search or 
logistic techniques that proved [O be hopelessly inefficient when faced with complex problems 
embedded in large knowledge ipacei, The current point of view is that the problem solver 
{whether man or machine) must know explicitly how to use ns knowledge -- with general 
techniques supplemented by domain specific pragmaUC knowhow. Thus, we ie* AI as having 
shifted from a poatr-bastii strategy few achieving intelligence to a k-ntwlcdge-baierf approach, 
Minsky and Paper: characterize these two points of view as follows: 

The PQUEP strategy seeks * generalized increase in 
ccrf.pL.ta t i ona poker, ]t mag look toward rvcu kinds of computers 
('parallel' or "fuzzy' or "easoc i a t i *e m or whatever I or H may 
■ pok toward e« tens ions of deduct i ve genera I 1ty, or information 
retrieval, or search algor I this* - things like better Vsso I Ut i on" 
methods* bettor net hod a for exploring trees ar.d nets, hash-Coded 
triplets, etc. Ih each case Ch& \.mprovanent sought is intended 
to be "yniforn« H — i ndepondent of the particular data oase. 

The KNPULEOGE strategy seas progress as cowing true Patter 
uaye to axpreaa, recognize, and use divarsa and particular forms 
of knowledge. This theory sees the prohleii as epi ste«otogicai 
rather than as a natter of computational peuer or mathematical 
g en e r a I i t y , It suppo Sc i, for ex amp I e. that uhpn a scientist 
SoivES a neu problem, he engages a highly organized structure of 
especially apprupriate facta, bade!?, analogies, planning 
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tneiheril$r*s, $eLf-disc Ip Iing proceciunis, etc. To be sure, he a I an 
engages 1 genera proo I en-SO I v mg scheiidia Put it is by ho niefing 
Obvious that very emart people are that u£y directly because of 
the Superior pouar o I their general, bet hods — ae cotnp^red ul th 
average people. Indirectly, perhaps,. put that jg another matter; 
a very Intelligent person might be that nag because of specific 
Jbcal features Of his k noth edge-organ 1 j i r.g knowledge rather than: 
-nE-aiiae of glooal qualities of hts thinking" p|~,ch, ewceoi for 
the effects of hi* seif-applied knowledge, mi^ht be little 
different frcn a child's. 

(Rinaj(.y r F'abert, 1974+ p. S3] 


It is worthwhile to observe here chat the goals of a know ledge-based approach to AI ar* 
closely Itin to those which motivated Piaget to call bll research center "Centre dTpistemologie 
Genetique" or, more precisely, to his reason for calling himself an "ep istemolpg i sf father than a 
psychologist, < he commori theme n the view that the process of intelligence is determined by the 
Icnowkdge head by the subject. The deep and primary questions, are to understand the operations 
ifid data structures involved. The physical (biological nr electrical) mechanism* underlying them 
are not seer by Us or by Piaget as the SOurCc of intelligence in any structural sense (although 
intelligence could not happen without some such EmbodimeriE^ 

Perhaps the knowledge paradigm li most clearly chariccsriied by a developmental perspective, 
Naturally, intelligence is impossible without a cenain basic ‘’hardware" capability. But bur position 
and PLaget's u that this capability is already present iti the very young child. The subsequent 
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development of intelligence is not due to the emergence of new biological capabilities in ihe 
individual. Rather* increased performance is due to the acquisition of knowledge. Thu knowledge 
can range from specific iacts about a domain to general problem solving schemata. Indeed* IE may 
even be knowledge about how to learn (for example, facts about debugging techniques). Some 
facts result in only local improvements in the individual's capabilities, while others may have 
global effects, giving rise fg a stage transition in PLagctian terms. From Lht computer standpoint* 
we see che need for some improvements in speed and memory capabilities, especially For such 
special purpose applications as vision and speech. But the fundamental difficulties facing 
researchers in the :icld today aie not limitations due to hardwire, but rather questions about how 
to represent large amounts of knowledge in ways that still allow the effective use of individual 
facts. 


L£ Artificial Intelligence as a P rocedural The o ry of Knowledge 

The reader may perhaps wonder at this point how Al is to have an identity of its own if ihe 
knowledge strategy is pursued, i.e if it asks only for explicit representations of the knowledge of 
particular domains, Iin L this simply the concern of the Specialized field of study associated with 
that domain? How does this notion of Al, for example, distinguish it from linguistics when the 
domain is that of language competence!' 

The answer *tes in identifying two concerns of A! which are uniquely prctfrufuraf in nature. 
The first is that the knowledge of a particular domain must be represented in a farm that is 
useable. A linguist may content himself wmh the specification of some attribute of grammatical 
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knowledge by defining the appropriate transforimtiMi: an A] rescarther must go nr* step further 
by asking how this transformation can be woven into the total web Of Jingoistic information in 
such a way that the comprehension or gen era non process n possible in reasonable computational 
times. Thus* AI brings to the traditional study of a given field the quest for Identifying and 
formalizing prvf matte knowledge, namely knowledge of 'how' traditional facts are to be used. 

There is a second concern which distinguishes A3 from traditional studies of a given domain. 
This is its concern for identifying the manner in which competence in different domains interact. 
Again drawing upon an example from the study of language, a grammarian can (egnmately fotos 
on attributes of gramma: divorced from the semantics or us? of language. The essentia! concern 
of AI, however, is to conduce frameworks in which all of these divers* kinds of knowledge can 
successfully Interact in order to allow the complete comprehension process to take place. Thus, both 

concerns which make Al unique arise from its responsibility to structure knowledge into procedural 
systems that can actually solve problems. 

I • 

L_3 Organization of this Paper 

+ 

Our goal in the remainder of this paper Will be to provide the reader with insight into the 
structure of theories currently being formulated in Al within this knowledge-based paradigm, 
Sections 2 and 3 illustrate the Al approach, by discussing a particular ex ample - frame theory - 
winch many people in the field are currently exploring and find quite ex tiring. Section 2 
introduces the subject by approaching frame theory as an Al extension of the traditional linguistic 
notion of case grammars from a narrowly syntactic theory to a general representation for 
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knowledge- Seaton 3 discusses (He various. attribute? of Al theories that frame? exemplify. In 
lection? 4 and 5, we deepen our vlsw of a< as a computational style of epL&temolQgy by -discusiing 
two aspects of cognition and language to which Al has made unitgur contribution? - the rok of 
self-knowledge ^nd the importance of control structure. Section 6 concludes our theoretical 
discussion by briefly viewing in perspective the intellectual history of the Held 

These topics do nw by any means exhaust The contribution of AI to the Study of Language. 
But w-e believe they provide an excellent illustration of rhe Al methodology. And (hat Seems Id us 
the only serious purpose to be served by a paper of this length that is atosssable both to student? 
of the field and to non-spectallies. Two companion essays tu this paper were prepared for the 
National Institute of Education by Winpgrad (1&71&) and Wilks (E976J and complement the 
discussion presented here. 

Throughout the paper, we avoid extensive excursion? into formalism and jargon- More 
seriously, we also ignore, or pass over in a cursory way, i variety of serious technical issues. We do 
this In order to focus atjeniion on the overall enterprise of a computational investigation of 
knowledge. We feel especially justified because of the tendency in the literature to focus on 
narrow tech Leal issues, Nevertheless, to compensate for this broad focus, we provide 1 in section 6.2 
a list of the technical questions which we feel are critical for the reader who wishes to pursue this 
paint of view more d reply. Also, the reader should rewgniie that the various research we paint to 
as illustrating our position have many important differences However, we feel That the critical 
need at this time is to develop ^ feeling for the paradigm which characternes AI as a method for 
Investigating knowledge and language, and it is to this goal that we direct our efforts. Finally, we 
and, Others in the Al community expect to develop further the ideas raised tn this paper. The 
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interested reader it invited Ed write us for references to the latest research in LhLS area. 


1.4 Educational Applications 

The final part of the essay (section 7) will touch schematically on some of the consequences for 
education of Artificial intelligence research on natural language. These consequences are of very 
different sorts. Some of Lhcm, no doubt for mou people the most obvious, are consequence* of 
actually possessing machines capable of understanding what a student says, Obviously such 
machines would greatly improve the quality of ‘Computer Aided Instruction" {in the sense of 
tutorial programs, programmed instruction administered by cgmpuTers, "drill and practice". 
Computerited advice, etc:) 

Mdch deeper consequences, however, art related to the intellectual content of Artificial 
Intelligence research on natural language. As a new kind of cognitive theory it has obvious 
implications for the education. It is our personal opinion that these consequences will very deeply 
transform the theory and practice of education. Although w c will not try to argue for this position 
a; length, we shah try to convey the flavor of the perceptions on which It LS based. 
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^“ a » 1C5 a Umlyinq Ftp-pr-u-ach >o Reprcaenhrig ft now Led ge 

Frame Theory ii m Al approach to representing knowledge (hai genera lien the linguistic 
rtotion of case grammars in a manner that allows, the representation not only of syntactic 
knowledge, hut also procedural knowledge for how to construe a parse as well as related semantic, 
contextual and thematic facts. Historically, frame theory is based upon more than purely linguistic 
considerations* In the original formulation by Minsky {1975). the Actions of frames and frame 
systems were developed in equal measure from problems arising from the construction of adequate 
theories for vision, for memory, for logical reasoning and for the comprehension of language. 
However, we shill narrow our focus in this section to consider the utility of frames solely in 
relation to linguistic knowledge. 3n section 3, we step back from the realm of language to consider 
a ‘variety of .arger issues that characterize the AJ approach to representing knowledge. 

2-1 Case Grammars 

The case frame approach to linguistics as developed by Fillmore (1965) and others constrains 
the enquiry to the realm of syntactic knowledge. Within this sphere, the basic ids* ts to organize 
structural knowledge of the sentence around the verb, The result is a cait grammar which 
specifies for each verb the cases which it requires. To illustrate this, we use The traditional 
linguistic device of choosing a representative sentence around which to organize our observations: 

(51} John tickled the girl with the feather. 

Following Chare’s formulation, a case-oriented approach would represent SI as follows: 

Tick let agent John 


patient 


the girl 
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instrument a feather 


2.2 Understanding a Sentence 

To make the linguistic enterprise tracubie* [he linguist tan ignore [he issue of understanding 
an Utterance beyond the narrow concern of identifying its syntactic structure. But A1 is forced to 
addreii this larger issue if computational models are to he designed that actually act appropriately 
in response to the utterance* Thus* AI is witling to consider the senses of understanding SI 
lllusiralrd by the following three listeners: 

A * ive year old child, who has a clear linage of John using the feather to 
evoke giggles, 

A slightly more sophisticated listener capable of taking account of a context 
such as; *Atl the girls in the room had strange hats. One was decorated with a 
large feather* another with a towering confection of artificial fruit, JoJrn 
ttckhd the g/rf cuJTA fit fttfhtT* 

A Eihguist who understands the sentence as an example of a wet] known 
ambiguity ;n parsing. 

It IS obvious Lhat all the subjects use some grammatical knowledge. For example the order of 
the words tells them that it was John who did the tickling and not the girl. But grammatical 
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inowledg. alone is not sufficient ever, to account for the sfnuctK parsing of the sentence, «. the 
interpretation of the phrjse 'will, the feather' at adjectival or adverbial. This point is made clear 
bp contrasting the sentence with 52 - 

John tickled the girl with the funny nose. 

Even our five year old would not be likely to read thu as if John used the nose ai a tickling 
inirrument. The difference h not puiilmg to common sense: after ail the child knows that 
seatherj are good for tilling while noses arc more likely to be used for smelling or as person at 
identifiers, Bui the difference should trouble the cognitive theorist who knows how to describe 
syntactic Structures id fine detail but has no better description than naive common sense of the 
child's knowledge about feathers and noses. And the difference is relentlessly present for the 
computer-oriented, theorist who absolutely needs to find ways to formalize it. 


g^gJTh e Procedural Idea of a Frante-Keeper 

Let US begin our generalization of case grammars by discussing informally how a procedural 
case theory might be organised. Following this, we w,LL discuss work by Riesbcck ( 1974 ) chat 
represents one formal realization of this point of view. 

In our informal mini-theory there is associated w L th each important word in the sentence an 
active agent which is smcrly defined as a computational procedure but Which we might just as well 
anthropomorphic {for the moment anyway) as a person, Let’s call, them F re mt~K**p e rj for 
reasons which w,]] WDn become apparent. The process of undemanding the sentence will consist 
of an interaction between these Frame-Keepers wherein each tries to fulfill certain needs. For 
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example the needs of the TlCKLE’FRAME keeper include; 


Find Che agenT (the person responsible for the tickling)! 

Find the patient (the person being nek led)! 

Find the instrument (the means by which the agent is tickling the patient)! 

So the TICKLE-FRAME keeper is seen a* actively seeking Co fill these slots written into his 
definition. If all or them can do so to their satisfaction (an idea which we wifi pursue shortly) the 
sentence is to he considered as parsed 

The essence of the AI extension of the case frame idea is Chat the frame for the verb contain 
not only syntactic information a bom the expected location Of the underlying Cases, e.g, \n a 
declarative sentence :hat the subject is the agent, the object the patiem arid the 'wjrhT prepositional 
phrase the instrument, but also semantic and pragmatic facts. Thus, we imagine that for our 
tickle-frame, the Frame-keeper has access to world knowledge that nates that feathers are Typical 
tickling instruments, while other kinds of objects, for example, dumbbells, are not. Hence, it would 
be the ttikie-frame Utp*r that would be in a position to reject for 52 that the Vth H phrase refers 
to the tickle instrument because it knows that one generally does not tickle with notes. 

In section 2.4, we discuss an existing program that associates both syntactic and semantic 
knowledge with individual words in order to comprehend the sentence. In section 3.5, we indicate 
how the frame idea can be extended to nouns. Given that extension, the process of 
comprehending 52 becomes clearer as not only does the tickle frame reject the nose phrase, but the 
gin frame accepts it as * legitimate descriptor Of people Thus, the jystem has two imirce? of 
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evidence both supporting one another regaling ?he intended parse, 


2.4 A Formal Realizatio n gf the Frame Idea 


Our discussion of the T1CKLETRAME and frame-keepers represents the kind of concern 
for active kwuiedge That AI brings to linguistic Studies, G. Riesbcck (LS“d, 1975) has designed and 
implemented a program that, although not frame-based, shares marry important properties wuh Llie 


approach outlined above, Ries beck’s parser maps natural language mto the Conceptual 
Dependency structures designed by $chan* (3973). S3 is typical of the sentences that Riesbrch's 
system analyzes: , 


John gave Mary a beating 

Procedural in formation js associated wjih each word in the sentence, which js triggered when the 
parser -encounters the word in its Is: r-; 0 ‘Tight analysis. This information constructs an instantiated 
rase-frame-like Structure. It also contains expectations that serve to disambiguate multiple umrd 
ierues and properly decode the syntactic cues that indicate the fillers for each slot m the case-frame 
associated with the Verb. A trace of the analysis of the parser is given in figure 3 r 


The system, called MARGIE, of which RLesbeek’s parser was a part, exhibited its 
understanding by generating paraphrases of the original sentence and by making deductions based 
on it (Schankj J972E The basic operation was to comprehend the sentence by mapping it into the 
conceptual dependency representation developed oy Sehank (1973), Figure £ illustrates, the 
conceptual dependency diagram generated for the above sentence. The details of Schank’s ' 
representation are not important here, except to note that MARGIE understood the sentence as 
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STEP W-'RL READ 


fifing 


Jehu 


t gave 


3 Hery 


beating 


period 


nune 


KI QUESTS tal"I MG 


1-1k there an 
NF? 


1-Ifc there an 
NP7 


S“is the current 
NF a hunan? 

J-ie thfc curretlE 
HP an object!' 

4* Is the current 
liT an sctlPUT 
5-true 


RE-QUFqTS TRIGCE-tLIi 


?-is th® current 

HP a human? 

3— Is eTi-fr curr-cnc 
HP an object? 

4- t» the current 
>T an ectldm? 


j-ia the current 
N7 on object? 
4-is the current 
HP an atxien? 
*-true 


7-does the cur¬ 
rent word end. 
■an HP? 


7-does the cur¬ 
rent word end on 
KP7 


■none 


3“ 1-9 the current 
HP an object? 
4-lc the current 
hT an octlirtt? 


ACTIONS TAKZti 


“I 


a.Hsirno " ! joIiii lr £= 
subject of the verb 
to follow 


SSi-Juine the word 
''iti" if It appears, 
liitrodue^a the re¬ 
cipient of die 
"giving" 


•as-suno Mg fjr ie the 

recipient of the 

"siitas" 




_ 


save trie current 
list of requests ana 
replace it with; 
7-dneg the current 
word eiid an HP? 


build the NP n a 
heatins -1 and reset 
the li&t of recueste 


assume the El? action 
is the <ha.In action 
bf the clause, the 
Subject {John} is 
the actor and the 
recipient (Mary) i* 
the Object 


Figure ]_ 

Trace the Hie&becfc Parser <pr! ’’John gave FLery a beating.' 


From (E-i-egbeck 1974. P. 31) 
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Conceptual Dependency Jkcpraiien tit Lm of file *e=ning of the sentence 

gave hary a beating.™ 


I 


Ft OC 1 (Hiecheck 1974. ?. 10ft) 
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referring to a series of actions by John involving the motion of his hand Riesbeck's parser has 
not erroneously interpreted "give" to mean that John is trawfenng the possession of something to 
Mary. 


2,5 Frames for HapresentinE Knowledge a bout Nouns 

i he case frame idea in its original linguistic form was oriented Awards representing; 
knowledge about verbs, AI pursuing the goal of comprehensive theories of knowledge ha; 
generalized the idea to allow frames for nouns. 


Consider again the sentence "John tickled the girl with the feather." We imagine that pan of 
the knowledge for properly understanding ihis sentence is that the verb-frame for tickle has as a 
typical mstrument feathers* Such verb-centered knowledge i; supplemented by frames fur such 
nouns as girl Which contain pointers to sub-frames which describe typical appearances, purposes, 
activities, etc. The appearance sub-frames would Indicate that girls do nut 'have" feathers. They 
might under closer scrutiny indicate that 'feathers' are a possible but unlikely kind of dress. 
However, the strong default for "feathers* as a typical titklmg instrument would predominate. 

A related, example shows where tvewn-frame information might dominate the verb-frame 
default, Cohsider the three sentences; 

■S-i) [ frightened the dng with the loud noise 
(S5) I frightened the dog with the brown fur. 
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{£6} I frightened the dog With the loud hark. 

Wr believe most readers would understand Si by assuming (has, the "loud noise" was (he 
instrument of frighten . This would be because of both the verb-fra me knowledge that laud 
noiJ.cs -often can be used to frighten and the noun-frame knowledge associated with dogs that 
indicate that "noise" U not "possessed* by Shcm. Or. the orher hand, in 35. the noun-frame 
knowledge for dogs would surely indicate that they typically have "f yr H , hence the phrase "with the 
brown fur' would be understood to be modifying the noun "dog". In £6, we suspect that our 
population of readers would be divided, depending on whether Lheir frame for "frightening* 
contained the typical instrument of emitting a toud noise, describable as a bark, with a stronger 
emphasis than their frame for dogs contained The default property of emiTCIhg noise by barking. 
(This- extension of frames to nouns. has not yet reached the point of development in the form of 
programs. However, a recent thesis by C. Drake (1975) ex tend 5 - the details of the theory beyond the 
outline provided by Minsky to a form that is closer to the Implementation state,) 

In cbe previous paragraph, we wrote as if the tickle frame could have ^frrex, es^teff 
knowledge about feathers. A fundamental and pervasive class of problems concerns the relative 
feasibilty o: achieving this as cgcupared to various schemes for Xndt.rr.cr reference to feathers via 
same super-sn or dejcrfptUm. For example, the tickle frame might know that "long, soft objects* 
ftre good instruments for tickling and somehow use [his as a means of finding a connection to 
leathers. Most early workers Jfi A1 (like most comempOrary psychologists) were strongly averse to 
the direct approach and sa a great deal of attention has been given to the invention of 'search 1 ' ox 
information retrieval methods to mediate the indirect approach. Current thinking (especially 
amongst f rame^heorisis) Jeans towards very much more direct knowledge Lhan was previously 
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considered feasible. Thu places a htavj demand on fnmary, and less on retrieval mechanisms. 
This appears [0 some to be heavy-handed". But we do know that human memory has a very 
large capacity and no one has been able [0 propose very sophiScared retrieval mechanisms. So 
perhaps it is nature (rather than the theorists) who has adopted :he "heavy handed"' solution. 


2-E Extendi he the Frame idea to a Theory of Context 


Procedural frames which are not restricted to verts are powerful tools for dealing with the jet 
of pmb.Ems traditionally called "context", indeed, once we break away from the verb-centered 
sentence as the unit for analysis, it becomes possible to talk about "scope" in 4 possibly very large 
text. 


This too is a topic oF current research and there are few programs implemented aj yef. One 
of the systems that does exist, however, IS Schank's successor to the MARGIE system that generates 
paraphrases of simple stories {Schank, 1975). The essentia] theoretical idea in the new system is the 
existence of frames (called scripts by SchaniO that describe typICAl ACtLvilies in terms of his 
Conceptual Dependency primitives. An example is the following informal sketch of a frame far 
going to a restaurant; 


Scripfj restaurant. 

Rd ei au3 1 ort, uaitreas. chef, eas-hie.^ 

Re as Obi to get food So as tc go down in burger Ond up in pleasure. 

Scene I Entering 

P7FANS - go in!* restaurant 
MBUILD w find table 
PTflANS - go to taale 
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HOVE - git down 

Scene 2 Ordering 

ATRANS - recerve menu 
ATTEND - look at it 
HBUlLB - decide on order 
nTRANS - toll order to m 3itress 

+■ 

Scene 3 Eating 

ATRfiNS - receive toed 

1 NDEST - oat food 

Scons 4 Exiting 

H7RANS - ask for cheek 
- ATRaNS - give tip to waitress 

PTrans - go to cashier 

ATRANS ” give mcney to cashier 
PTRANS - go out of restaurant 

(Schank, 1075. p. 117) 


The image of understanding a story is mapping the sentences of the story into the act Lons 
described Ln the frame, Unstated facts described Ln the frame ire assumed. Thm, mapping a 
story in which ;be sentence appeared: 'John left a tip" into this frame would allow the system ro 
know that the “tip" was left for the waitress.! fact contained in the frame but not in the story, 

2,7 Frames for Thematic Knowledge 


Our previous examples have discussed extensions of th? case frame idea to incorporate into 
She frame a larger fTlCtiOTi of the knowledge associated wnh comprehending a sentence than is 
Included within the purely grammatical realm. We would like to conclude by extending our 
representation beyond the sentential level to include the kinds of knowledge that link semenccs 
together, i HiS represents the frontier o; current research and so we will have few formal examples 
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and have tp con tent ourselves with seme observa tio-ni wh^h indicate the direction in which we 
expect this research to go. 

To develop tKa5 notion of thematic frames, let ui consider the problem of undemanding a 
fairy taSe, Our goal (5 to represent the knowledge that links .sentences, in a si or)' together mta a 
coherent whole. To begin, we will first examine so™ work by Rumelharc {197i) which addresses 
this question, but utilizes a phrase-structured grammar (augmented by semantic interpretation rules)* 
to represent. Che structure of The story. TypjcaL of the rules that his system possesses are: 

RULE L: STORY SETTING + EPISODE. 

The setting is a statement of the time and place of the story as wed as an introduction for the main 

characters, e g. Once upon a lime, in a far away land, there lived a good king. - 

RULE E: SETTING -> (STATE VES ) 5 

Episodes conidSI of events followed by reactions 

RULE 3; EPISODE -> EVENT * REACTION 

An event is the itujjt general category of the grammar. 

RULE d: EVENT -> EPISODE or CHANCE-OF-STATE or ACTION 

or EVENT*EVENT 

A reaction consists of an interna] and an overt response: 

RULE 5: REACTION -> INTERNAL-RESPONSE * OVERT-RESPONSE. 

There are a large number of in tern a 3-responses. Two art 

RULE INTERNAL-RESPONSE EMOTION or DESIRE. 

Overt-responses can be elthei an action or a series of attempts: 

RULE 7: OVERT-RESPONSE ACTION or (ATTEMPT) 5 
Attempts also have an internal itruceurr. 
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RULE 9: ATTEMPT PLAN * APPLICATION. 

The plan Ji developed to accomplish a desire. The application Li the actual execution or" the plan. 

Far each of these rules, there are semantic interpretation rules that mditaLe for example that 
rule 3 mean? that both che setting and the episode must be present while rule 3 (with she same 
syntactic V connective} indicates that the event is the cause of the reaction and precedes the 
reaction in Ume- 

Lei us now a?k how the skeleton represented by this story grammar might be ex tended, imo a 
frame system. The baste change is the addition of large amounts of highly structured And 
particular knowledge. This information which w# shall call dtfavit knowltdgi includes 
information about the various aspect? of the story, e.g. setting, episodes, internal anti external 
responses. Indeed, the proposal is that the system knows, a few "favorite' stones in extensive detail. 
Thus, with Only a brief introduction to the story such as "Once upon a Lime and [prig ngo", the 
listener is able to bring to mind an elaborate setting corresponding io his primary default -- 
perhaps the Camelot setting, the Elack Forest setting, or the humble outage depending on the 
idiosyncracies of the frame system of the individual listener. Of course, conflicts will arise from 
difference between (he Story and the default, causing one of two results: either exceptions are 
entered into the evolving data base representing the system's comprehension of the story, ar, if the 
conflict is too radical, the entire default selling is replaced with another. In either case, the result is 
that far more detail than is actually stated in the story is assumed by the listener through this 
mechanism of default knowledge. 

Just as there can be drfau.t knowledge about scares such as SKLing, so too can there be default 
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knowledge about the expEcted sequence of episodes in [he story, where each episode is itself a 
Frame (e.g. a frame for the initial State - a damsel m distress, followed by a sequence Of frames 
describing a Typical rescue operation by a handsome prince and hmdered l>y a wicked witch). This 
■organ nation of frames into state action sequences is essentially the Script idea of Schank's 
discussed earlier, DffftU*t knowledge about expected themes, morals, narrative Structures may also 
be included. We expect, For example, For a Fable to have a moral, for the prince and princess to 
k^E happily ever after in a children's fairytale (though, not in the original primitive folklore). 

The point of this discussion is to articulate this model of comprehension in which the process 
IS not one of literally understanding- the text, but instead is more one in which the text Eluggers 
rich, highly structured knowledge packets Chat supplement the literal Content, provide expectations 
for the remainder of the story and place the story in a'context of relaTed knowledge,, 

Understanding is seen as Essentially a process of evoking and then debugging existing knowledge 
packets 
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3. Issues Jr. Computational Epistemology 

A[ III is point, we have seen how the linguistic theory of ca^ grammars has been extended by 
AI ttf bKDme a fcnnwledge-oriemfti theory of language. We now step back from a purely linguistic 
orientation in order to gain a perspective on the genera] theory of intelligence and knowledge 
embedded in [tits point or view. Specifically., we jhaLi examine: 

(J) the manner in Which frames exhibit a readiness 50 use new, flexible arid 
complex data types; 

(2) the fashion in which frames represent a synthesis of procedural and 
declarative knowledge, 

( 3 ) the sense in which the use of an anthropomorphic metaphor has become a 
technical device m AI theory; 

(i) the trend towards the representation of knowledge in larger, mare structured 
units (which frames excmplLfy); 


{5) a concern for allowing Useful interactions between diverse kinds of 
knowledge; 


{&) a desire for expertly controlled processes, 
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3.1 Representing Declarative Knqwledge 

A cent rat concern of At is the construction of new datatypes and the exploration of their 
capabiJitjes. The frame idea «s still evolving m Al. and jo- le is difficult to give a precise 
definition, Minsky gives the following description of a frame in his original paper D n ;he subject; 

U© can think of a frame as a network of ncdss antf relations. The 
top I eve la" of a frame are fixed, and represent things that are 
Sij-r&LfS teyo dDOLst the supposed situation. The 1 outer levels havg 
many terminals -- "slot*" that tmst he fit led by specific 
instances or -la. a, Each terminal car spue'sty conditions its 
assi tjrtnants must. mast. [The assignments themselves 3 re usually 
smaller "sub-frOnes, " J Sinple conditions are specified by 
markers that might require! £i terminal assignment to be a person, 
ar, object of sufficient val ue h or a pointer to a sub-frane qf a 
■certain, type* Here Couple* conditions can specify relations 
among the things assigned to severar terminals. 

(ttinsky* 11375, p. 212 J 

We we from the above description that the traditional dtclarativa date structures qr object j, 
properties and relations plays an important tok tn the depjnllton or a frame, This aspect of frame 
cheery ha* Jii roots in earlier work in A3 on the representation of knowledge as atcrjbute-viiue 
pairs (e.g. the BASEBALL program {Green, L9GJ)) and ss .semantic tiers {QuLlhah, 19oS). The 
notion of a Semantic Net has already had an important impact on cognitive psychology, cf work 
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by Anderson and Sower (i&'JJJ and Norman (]975>. The additional ideal added by frame theory 
which generalise the notion of a Semantic Net are: 

(]) the nrganiiataon or" These relations, hOT JniO a uniform net, but Into a highly 
structured set of contexts {the frame* themselves); 

(2) the inclusion ftf default* for properties and relation* representing the 
expected values and supplying common senie knowledge about The context 
(possibly not Supplied by the actual text}. and 

(3) the infusion of procedural knowledge which we shall diseu** below. 


3.2 Repregenling Procedural Knowledge 

lr sections 2.2 and 2.J, we saw informally how procedural expertise can b^ associated wnh a 
frame in the farm of a Ft&irse-Keeper, a procedure whose action i* to examine the current context 
for data that can fill the sIpts of the frame. To further clarify the concept, we begin with a 
gedanken example of how an A1 program might emulate a certain aspect of the way a 
mathematician thioks. The example goes right into the domain of the logician and propose* a 
Very different way to cope with the existence of paradoxes *uch as the But'alL-Forle paradox in 
mathematics and such paradoxes in ordinary language as the use of self-reflexive assertions iueh 
as A51 Cretan* are liars'* (ipoxen by a Cretan and meaning- Cretans never tcEl the truth). The 
traditional logicians recourse i* to seek an axidmatuatjon from which no inch contradict I on* can 
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hr deduced. Our theory of whit an working mithemaUnan actually does li to use principles of 
reasoning which could easily lead to paradoxes if they were ;o be applied in that direction. But 
the mathematic Lari knows chtS and refrains from using them In directions known (0 be dangerous. 
It is as if he posted a sentry near dangerous plates in his intellectual terrain . The job of the sentry 
is to issue warnings if any reasoning process comes too dose to the danger spot, [n AE we see as 
our job developing theories in which the concept of such a Sentry can be easily form allied. 

The anthropomorphic idea of jiosting a sentry as typical, we maintain, of the way LIT which 
sensible people have thought intuitively about this kind of problem, The contribution of computer 
science is to show how to translate this intuitive matter into a technical and well defined form. If 
one represents a knowledge system as a set of proportions, the idea of introducing the sentry is 
outrageous, or At best a literary conceit. But if the knowledge system is represented in the first 
place as a set of interactive processes, fhe sentry is merely one more process. Naturally one has ro 
be sure Chat cblS new process is wdt defined. Uut that is a technical problem, The conceptual 
change is that it ts now perfectly natural and coherent to introduce such entities. From this 
perspective, a fundamental contribution of AI to epistemology is made clear: and that is. the 
systematic introduction of active agents into epistemological theory cc?nstructions h so that, fur 
example, an item of knowledge, a concept, or whatever, is seen as an active agent rather chon a 
passive man ipu [stable object. (See Gieif and Hewitt (M for recent work in this direction.) 

The gedankefa example from mathematics seemed to us well suited to illustrate the force of 
this concept m an intuitive way. But there are real applications applied to quite real problems. A 
well known example is Ch aimak's method for dealing with reference problems in child ten's stones. 
To illustrate this, consider the following story fragment from (Charmsk, 1572). 
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Today was Jack's birthday, Penny and Janet went to the stoThey wtre 
& Din S to get presents. Janet decided u get a k,te "Don't do that,' said Penny. 

Jack has a kite. He will make you Cake it back." 

The goal is to construct a themy chat explains how the reader understands Chat "tf" refers to rhe 
ne* kite, not the one Jack already owns. Purely syntactic Cnterii (such as ^signing (be referent of 
”tr to the last mentioned noun) are clearly inadequate, as the result would be to mistakenly 

understand the last sentence of the story as meaning that Jack will make Janet take back the kite 
At already owns, 

Resolving teierencc es a traditional problem of linguistics that cannot foe handled foy purely 
syntactic techniques, Our solution will involve the realisation of our intuitive "frame-keeper - as a 
precise procedural entity, the dewn, This is a production-like entity that is activated by some 
aspect of the state of ihe nmpuntiDHl world within which it is defined. But we will postpone the 

details of this solution until section 3.4 in order to first give some perspective on the underlying A1 
philosophy, 


3.3 Representing Kno wledge in Highly Structured Packets 

The fundamental Frame assumption is the thesis that /rarer j tiMtim* tfiM lire ttonk itt really 
OS ^ as we tAtnk, Mot: situations in which people find themselves have sufficient in common 
with previously encountered situations for the sahem features to be pie-analyzed and stored in a 
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situation specific form. Whenever this happens we ire able ro uie 'pre-digmed", highly particular 
knowledge - in explanation in sharp contrast with the views of intelligence propounded by 
theories who would prefer to believe that our minds always work {and have to worUJ m * 
deductive fashion from very general principles. 

For our km example, it is clear that one qinnOL know that V refers to the hew kite without 
knowledge About the trading habits or our society One could imagine a different world in which 
newly bought objects are never returned to the store, but old ones are. The question we raise here 
is ho-w this knowledge might be represented, stored and made available to the process of 
understanding Charrtiak's story. To see the Spectrum of possible theories consider two extremes: 


An Extreme in the Frame Direction- 


The listener has Stored away in long term memory a frame for birthday parties in winch 
«rtam events and problems are explicitly represented. These include the problem of choosing a 
present with quite explicit reference to the three or four "possible dangers to be thought about 1 * 

including "recipient wont like it and will be disappointed" and "recipient already has one and will 
return, ft", 


Alt Extreme ;n :hc Anti Fra me Jirecnon 

From A Strictly logical point of view it is not necessary to have a pre-constructed party frame 
ih order to assign che proper referent to "it". Et as unnecessary to have been TO a party or have 
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given a present. I; ™ u id be stiffuient tg have stored away for g W « a | use items D f information 
such 4i these about returnjng gift*: 

tit U« has besn Dought at atone y) and t* it in gap* cbndi tianSI 
rftipliee tutors y mil e^chango *• far z) 

[2i s lit is rtpii) and lx haa rot been used]} 
impliti tx is in good condition) + 

Th« advantage of this method {if it IS Workable) Are apparent; less memory space, easier 
updating, modularity, etc. The major disadvantage is that we then have ro fare the question of 
how these two items of information are chosen from fhe presumably very large stock of knowledge 

presumably including many other items rather similar to these. How does the interpretive system 
know which Items to use? 

An even mare serious problem takes the form: how does the system know rhat arr^ appeal to 

knowledge about the world is necessary? Are we to assume that simjile syntactic rules (e g. based tsn 

word order and proximity) are never used? But,(they are sometimes used, what restrains them in 
this case? 

By providing possible answers to thm question, the frame theory shows itself to be much mare 
Ehan a me:ho(i fe£ information retrieval or "hennas search." One such answer ts chat Charn Lab's 
reference problem simply doss not arise in the Sense that the Eirthday-Present-Frame-Keeper wlII 
already have assigned the new kite la a slot for 'returned gifts" before the pronoun is processed! 
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3.4 The D&man Solution to Ihg R efei'oraeg Pro blem 

Charniak * 157? research on constructing a model for compreSiendmg children’! stories 
predates the articulation of the frame idea, but presages important notions of chit concept in 
several ways. In particular,, h.s formal re-aluaLian of a frame was in the form of tnurknowledge 
about a large variety of situations that arise In the context of these stories. The mechanism of his 
program was for the content of sentences to evoke this base knowledge wiLh the following effect: 
demons ("f ram e-keepers" in our terminology) weie created to monitor the possible occurrence in 
later sentences oF likely (but not inevitable) con sequences of the given situation, Thus, Tar our 
Story fragment the birthday knowledge creates expectations about the need for participants of Che 
party to buy presents and the possible consequence of having to return these gifts. Hence, theso 
demons expect the possibility of Jack already possessing the present and the resulting need for 
Janet to return it, where li is known to be Che present, 

It is interesting to note that this demon about present* cannot be applied unless several 
common sense inferences are made. For Sample, the demon might be a procedural representation 
of the following fact 

If a person PI does not want A present X, then 
he tiag request the give.* P2 to return the pr-e&ent. 


For .he demon to be applied, the ancecent, i,e. that PI does not want the present X h must be 
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triggered by an assertion in the databai f being created to represent [He meaning of [be j EO ry. (The 
database may be [nought of as containing [he kernel sentences of the stoty with each word mapped 
into its proper iensc and reference problems disambiguated). Bui (he story does not include 
explicitly all important farts. Look back at the story Some readers Will be surprised to note that 
the teitr itself does not state {a} that the presents bought by penny and Janet were for /nek, (b) 
that the f op bought by janei svas mrended as a present, and (c) that having an object implies [hai 
one does POE want another, All of the above farts are inserted into the database by other demons 
mad« activated by the birthday frame. Thus, Charniak's notion of a demon is particularly 
satisfying because not only do demons handle reference problems, ihey are *1*0 UM d to insert into 

the story database various reasons., motivations and purposes not exphctly stated- 

i 

Ift t:ie currtnt VJ(?W frame theory, this Lnsercion of unsaid information assumes a 
paramount role. From this point of view, it is not accidental that so much is left out of the story. 
Rather the typical situation in comprehension, is to be faced with a set of dues that evoke a nth 
and detailed knowledge structure, the frame, that supplies the unstated details. Naturally, these 
defaults may be inappropriate for some Situations and. in those cases, the text mutt Jupply the 
exceptions. But in general, reference, deduction and comprehension tic made possible only by 
knowing a great deal about the sduahori being discussed. 


ULjrhe Virtue of Particular Knuwladrre 

AI [£ a young science and the ideas that we have discussed are under constant development, 
riaboratian and change. However, tt has been our goal to give the reader a feeling for the kind of 
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enterpriw that AT II rather thin a final itatenant D n its theoretical form. 

A central point has been the fa^ior, in which many kind* of knowledge play a role in 
cognitive ptm In particular we cormdered for the rtf wm « problem the need for Rea] 
World versus Syntactic Knowledge and. more profoundly, Particular versus General Knowledge. 
A tnajar pojnt of cur analysis, \ A of the Knowledge as opposed to Power theory of intelligence, is 

that Particular Knowledge fan Often do the jot? For Which Central Knowledge is traditionally 
thought to be necessary, 
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4, SaiMtnowicdce 

In sections 4 and h. we leave the cooks[ of Frame Theory in order to broaden, our insight 
into the fid! LOn of A1 as a new kind ef Epistemology, To do this, we shall disco is rwo concerns 
which are «ntral to AI (henries but rarely appear in more traditional formula Cions oF knowledge 
and intelligence: the farif is Che forn^aS representation and use of self-knowledge and the second is 
the importance of control structures. 

i 

• ‘ | , ; 

4,1 Kinda of Self-Knowledge 

As we inaJ see, an interest in Sek-Knowledge will not ta*:e uj outilde our concern for 
understanding 1 the comprehension process Indeed, we shall find that this kind of knowledge plays 
an obvious role in allowing the system (whether computer model or individual) to answer questions 
about its own performance as we]] as a role in the less obvjOUs situation of handling special cases 
that an jo in parsing the syntactic structure of the utterance. 

There are many kinds of self-knowledge. Perhaps the simplest Jnhc ability to observe one's 
own behavior, allowing the individual or program to answer elementary questions about Its actions, 
A more skillful use of self-knowledge would involve being able to take account of then 
observations and direct One's problem solving accordingly. Still a more sophisticaied program 
rmght, ID Certain Circumstances, Utilize this self-observation to irarn by debugging Faults m its 
procedures, generalizing its knowledge, or perhaps suitably specialising It to form nvore efficient 
techniques for particular problems. We shall illustrate each of these kinds of self-referential 
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Abui-iei beiow in i£]auon Id the 6-locks VVorld, a favorite iSuni-world for A] programs m which a 
pne-armed robot is asked 10 buald various Structures from blocks placed on a table (figure 3 y. 


4 ,2 The SHRDLU Pr'qgrant and the Blocks World 


Wmograd s SHRDLLf program represented a -four dt fora of design and an important 
benchmark in the realm of natural language comprehension by computer, tn Lhis seetjan, we 
introduce the program briefly and Jn she next two illustrate Several ways in which il manifests 
rudimentary setf-knowledge. 

SHRDLU ts a program that converted such sentences as Tick up the big red block and put it 
in h be box into commands fur a blocks world problem solver The success of its comprehension 
was demonstrated by the program successively carrying our the insiructLcns In a simulated world 
displayed on a cathode ray screen which contained a one-armed robot., a table* a box and a 
col [tenon of cubes and pyramids, 

The progam ts capable of comprehending sentences that reached the degree of complexity 
illustrated by the following examples 

([) Find the block whLch is taller thin the one ynu are holding and put it into 
the box* 


(2) Will you please stack up both of the red blocks and either a green cube or a 
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J^fE ROBOT’S WDItUl 



Figure J 

"ftcfc up the 'big red ilock,' - 
Fron (Hinoflirad lgjj, p, b> , 
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pyramid. 

{3) Does the shoiccit thing- the catltic pyramid*! support supports support 
anything green? 

In SJ-IRDLU, Eh ere are a multitude of different knowledge strictures, For ex amp Ip, the 
system maintains event-history Lists recording each event, why it occurred and what It 
accomplished: numerical Coordinate data structure recoins the dimension and Ideation Of objects; 
a symboi.se database in which assertions about the state of the world, about global knowledge and 
about active procedures are stored; and a Fedor and Katr style marker hierarchy (Kata, mi) 
indicating the relation between words. But, perhaps most Significantly, a major part of the syitems 
knowledge was represented directly as procedures, thereby achieving far more flexibility and pow Er 
than more traditional 'declarative* representations. These different knowledge packets are used in 
many ways; to answer questions, to perform actions, to update its knowledge. 

43 Answering Elementary Why Quest ions 

An elementary example of self-knowledge is found in SHRDLUs capacity lu generate name* 
for assertions in its database, events, noun groups and ocher entities. With these names, the system 
can refer back to these entities, associating rhem with times, places and other information, This 

atlows the system to answer elementary kinds of questions about its behavior, Ftgur* 4 illustrate 
this ability. 
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2J. had you touched any pyroU before you put the green pne* mi the 
little cube? 

YSS F THE GEEIfl 0T3E. 

The Bytien deduces frem Its memory uhat event is beiQJEi ceferrid c&, arc 
“ n Other events to it in tine. The Includes complex Lenses 

tif verb groups ♦ and there ate special facilities for keeping track of the 
vatl&iie farms of irrefcular verbs like "l-.ave." 

24, when did you pick it up? 

WEILS I WAS STACKING UP TEE RED CUBE, A LARGE REC BLOCK ANT. 4 
LARGE GRTRn CUBE. 

A claw Is described by Mining t'nc u*in s^a.1 which was. being worked on. 
15. why? 

Tint system rKnembers limediutely previous sent emcee L* under* tJtsd ques¬ 
tions like this and the (Allowing opes, 

TO ■GET jUL OP IT, 

By keeping track of selected parts of the original sutgoal tree, Lhe 
syaten has snm? unde Fa Landing of Lib asm mpttVM, 

2 b, why did you an Ch-filT 

TO CLEAh OFF THI RED CURE, 

"That 11 rafare C* the action irentinned la the answer, 

27 * why did yam clear off that cufce? 

kero,, "that 11 scan^ the one Just refereed to, 

TO PUT IT ON A 1ARCE CEEHl CUBE, 

IB. way did you da that? 

70 STACK UP THE RED CL’SE, A LAHCE RED BLOCK AM> A [.ARGI GREEN CiJBE, 
29. why die) you do that? 

BECAUSE YOU ASREH ME 70. 

ll remember?. what it did* not haw the request was warded. 

fiUdte 4 

Examples of SEBULU's ability to answer WH¥ quwtltus. 

Uron Oftnograd 19?2, p. 13) T 
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4.4 Sftjhrefergn ce i n Grammatical Systems 

A second level Of sdr-rekrentj.il abiliLty m which [he system can nDC onI}' answer questions, 
but make decisions based on sHkfcnowledge, ii also illustrated by 5URDLU. This u the manner 
in which the intern has access to OK only the current state oF the parse, but a fe» to the past history 
Of the parse. This is illumed by the fashion in which Wind's program parses conjunctions. 

To parse sentences with words like “and", the system invokes a special routine upon detecting 
the conjunction Which notes the last constituent formed and attempts to find a duplicate constituent 
following the conjunction. If unsuccessful, it looks further back in the history of the parse until a 
substructure can be found that ^metrically appears following the conjunction, A similar capability 
is found in Woods Augmented Transition Network approach to parsing (Woods, I970L Elements 
of this approach are discussed m section 5.1 on Controlling Grammatical Compiles tiers. 

It js interesting to note that "and" is not handled simply by augmenting the basic Systemic 
Grammar of SHRDLU. By same standards, fhfci moving outside the formalism of one theoretical 
mechanism would be unaesthetic by being contrary to Occam’s Rajor But experience has shown 
that the management of divers* knowledge packages is the rule rather than the exception in the 
architecture of complex problem solving systems. The obligation is simply one of clearly defining 
the interactions between these knowledge packets. 
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4,5 Qebuggtng <naw/led^e 

I 

Recently, the Rtocki World has served a* a domain for investigating seif knowledge af a 
deeper kind. This has been in the iWm of a system called HACKER (Susimart, 1974) which it 
capable of learning to solve block construction problems of the complexity that SHRDLU can 
handle. (Remember chat for SHRDLU. the problem solving procedures are pre-programmed and 
not learned.) The learning is achieved through HACKER possessing an experts m debugging. 
It is our belief that debugging knowledge is absolutely fundamental to the acquisition and 
operation of any program as complex as human thought and language. 

Although this work was not done specifically as par t of * natural language comprehension 
^stem, it provides the basic understanding to enable surh a system to answer questions about why 
a particular method was used, in a far deeper fashion than SHRDLU's elementary abilities 
allowed, Knowledge Of one's Own problem solving Strategies supports an understanding that not 
only could discuss the chronology of events but could also discuss the underlying plans And their 
evolution through succeeding sages of generalisation and debugging 

To illustrate this, we shall develop an example from Sussman. HACKER initially might be 
compared to a small child interested in comlroctlng block structures, but not very knowledgeable 
about the problems involved. HACKER begins wUfa the ability to place one block on top of 
another {by executing the primitive PUTON) and with an elementary planning capability, A 
typLoal elementary goal is tq construct a Lower from three blocks, expressed formally to the program 
by Che conjunction (COAL (AMD {PUTON A Bj (PUTON & C))). HACKER attempts to solve 
this problem in a linear" fashion by first planning to achieve (PUTON A B), and Then as an 
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independent tccand goal: (PU . ON B C). This produces the Sequence of Mock States illustrated 

below; 


Trace of (ACHIEVE C.ANE3 fQN A 0) (ON B Cl 1 


state 1: A 3 C 


A 

state 2t B C 


1 PUTDN A 9 has jug-t been enEnutep. 


st^t^ 3t A B C 5 The- top pf 6 ha& been cleared as a required 

:pre-requi site qf PU70N 9 C. HAOC£H is 
5 one-handed and trust lift flocks frem the top. 


0 

State A c 


fPUTOM 9 C has just been completed. 


Obviausiy a bug has occurred. The action of dealing the Lap of B as a pt e-requisite to- achieving 
(ON B G) resulting Ln state 3 has undone the first jub‘goa.1 (ON A H) whose purpose was to place 
block A on block B. 


One solution to this difficulty whjcb A1 programs of the past might have attempted (and 
which occasionally children pursue), is to return CO the original state and try some different 
construction strategy, without attempting to learn anything from the current failure to guide the 
nem attempt. HACKER contain^' a new kind of knowledge, namely knowledge dF how to debus . 
This debugging■ knowledge knows that linear wknioris are only ftrw approximation* and that 
sometimes interactions between the sub-goals of a conjunction must be taken into account.. In tbtJ 
particular case, HACKER recognizes that the pre-requisite of one sub-goal CLEARTOP conflicts 
with the purpose of a brother goal, ln such situations, it applies the debugging strategy of seeking 
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a reorder]nl' Of the brother goals such that the goal with the pre-requisite ii achieved first, To (hii 

particular problem, HACKER: 

J) recognizes. 'haL the bug u due to rhegenerai bug-type of "pre-requisite 
clobbers brother goal" 

{ 2 ) applies the debugging strategy pf reordering the goals. 

(Jj edits m longterm memory the procedure Tor constructing towers of three 
blocks to take account of thii ordering requirement. 

H) records a comment to itself for subsequent problems of this kind that states 
that When building block towers pf any size, ON chains should be ordered 
from the butwwn up. 

Thus* HACKER exhibits both expertise about debugging and about repair, couplod with the 
ability to examine its own problem solving goals and. actions so that at is able to apply this 
debugging expense ro its own reasoning. It represents a model of skill acquisition whose 
competence jj based on self-knowledge about its bchav]pr and meia-knowledge about the nature of 
plans and bugs. 

We emphasize the importance pf debugging skill because a vital element of intelligence is 
knowing how to handle a vajx variety of situations. We cannot do this by knowing all about these 
iltuations because each real world situation is different. What we do need it 
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([} knowing how 10 tell winch old situation II sufficient!}' like the present ere, 
and 

(2) knowing ways rg adapt - debug' - the old procedure for handling the old 
problem tp a new procedure that can deal with ;hc new situation. 

' . ■, -. i 

Thus, subjects tike analogy, similarity, metaphor are at the heart of the new formalisms. In the 
older oi AC - ^IfUiititi. logic, and psychology, these were embossing, hard-tfr^plam 

phenomena. In the new, debugging technology, they become concrete, manipulate types of 
knowledge. 

Finally, without a theory of debugging, our image of frame-based comprehension would have 
a fatal flaw. Understanding by using large packet* of previously known information reequircs the 
ability to dtbug these knowledge structures for the particular appl,cation at hand, if the system is 
to be able to deal with novel sttuatigmi, 
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5. Control Structures 

We have seen the way m ^hich A! epistemology is opening new areas of study in its concern 
for self-knowledge and, cr< particular, debugging skill, A second dimension along which AVt 
approach to epistemology is opening new horizon* and fundamentally differs from older 
approaches is its attention la mini lawturt. Indeed the idea of control structure could scarcely 
cKJSt until Dne begins to see knowledge as made up of active agents which need to be controlled. 

5.1 Contro! ling Gramm-aticaf Comput at i on s 

first-order theories such as chose composed of generative grammars are constructed under 
strong assumptions of modularity and independence. These assumptions ire heuristic ones that 
enable the epLStemologscal analysts td get underway, but include an expectation of a subsequent 
need TO study the interactions. Controlling computation by choosing between the relevance of 
alternative chunks of knowledge, deciding when a given approach should be terminated, and 
generally guiding the reasoning process are among the important kinds of interactions which must 
be eventually understood. 

When contikt-frEe grammars were first applied to the problem of comprehending text (rather 
than only generating grammatical sentences}, it was immediately recognized that more structure was 
heeded if the parsing times w, rc to he kept within reasonable bounds. The essential problem „ to 
provide information for choosing whether a given rule should be applied. It is this basic concern 
Chat Characterizes the basic approach of both Augmented Phrase Structured Grammars (Heidern, 
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]«75) and Augmented Transition Nets {Woods, 197Q), The ATIN approach has served as the bails 
Of several important language programs and. we shall dlSCUSS II further below (Set for example 
<SimmQm t l9-'5) in which A. Ns are used both for parsing mio an internal representation and for 
generation of English output from this representation.) 


5.2 A U lamentedtransition Met works 

; ' . . : i ' . . • . 

The Augmented Transition Network formalism far expressing grammars developed by 

Woods was designed to extend the power or Finite State recognition grammars. His first step was 

to generalize the finite state graph to allow recursion,, thus essentially obtaining The power of a 

context-free grammar while still preserving the perspicuity of a network representation. A 

recursive tall Js indicated by the label of an arc being a State name rather than a terminal. This 
I 

directs the recognizer to jump to that state, run until acceptance, and then return ■control to the 
state pointed to by the am Essentially a pushdown store has been provided to keep track af the 
bookkeeping involved in allowing recursion. Figure 5 illustrates a Recursive Transition Net. An 
example of recursion occurs in the interpreter's transition from state S to state Q) by means of the 
recognition of a noon phrase. 

Woods nest step along the dimensions of adding control was to genera hie these recursive 
transition networks further and provide for both a preference ordering on [he arcs exiting' a given 
node and arbitrary predicates which act as filters and must be satisfied for the node to be pursued. 
See figure S for a piece of such an Augmented Transition Nm 
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Figure cj 

n Ssnaple Recursive Transition Nctu*>rk 
Ftoh (Wucds 1B TO* F. 532 ). 
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With the additional augmentation of allow,ng [he system to maintain registers holding 
Various pieces of formation found during the par*, the ATM formalism has the full capability 
Of a transformational grammar, but incorporates these abilities in a form that admits of far more 
efficient parsing algorithms than those previously proposed for transformation grammars. This 
approach has proved successful and new serves as the basis of Woods Lunar system (Woods, 197i) 
and several systems constructed by Simmons (1373; 1975 ), 

Winograd's SHRDLU also addressed the control issue. Woods added control knowledge to a 
state-oriented description, Wmograd Added control knowledge by pursuing an entirely procedural 
approach. Ho represented grammatical knowledge as programs. These programs, being 
procedures, could have conditionals, loops, variables, of recursion. Figure 7 illustrates some 
simplified programs for mgnlung senientes. noun phrases and verb phrases (expressed as 
flowcharts rather than in Winograd's Programmar formalism), 

Woods preferred to preserve some element of declarative representation by dearly naming the 
stales: Wanngrad preferred the freedom of an entirely procedural description. The approaches are 
Computationally equivalent (Warned, afl d both represent the fashion in Which first order 
grammatical Theories can be emended to support efficient computation. 

Of course, most At theories are computationally equivalent just as most programming 
languages can through more or less Saber be made to emulate each other. This is not. however, the 
reat issue. The critical question is the perspicuity of the language or representation for the 
particular applications envisioned. Woods emphasizes the virtues of a net description: Wmograd 
prefers expression directly as programs. 


Golditein, Paperr 


Knowledge, Language and AE 


5] 


PWlME pr«|:rM iSNUHCE 



ucn vx iniin Hr 




Figure 7 

FLflvdiartfi ai elwlilltd PRQCIlAlWAR scanners f* E tisnni ptir*6ee h vert phrase a , 

and usnesfLces. 


Ff-Soi (Ulnagrad 197 Zl ?. -93), 
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5.3 An Issue foj_t.be Future — Deterministic Parsing 

A concern for control structure raises an nbvious quesu&fi for the Futuie: will Lt be possible tg 
organdie grammatical knowledge (Including interactions with other kinds of knowledge) in such a 
way that back-up js eliminated (without recourse to parallelism which is ultimately 'equivalent to 
back-up) at least for those sentences which one intuitively feels do not have a multitude of 
interpretations. 

This goal of eliminating exhaustive search represents, a natural concern of the Computational 
EpiStemologist, First dei ine the basic knowledge Spate. Then analyze Its procedural structure. 
Viewing ths process of finding a .solution as a search branching at choice points, the question 
arises: Flow is this branching controlled?" One possibility is to allow for backup. A second is to 
utlhre paraLielism. A third is to have diagnostic experts that can guide the decision, at choice 
points so that backup is rmniffiiied or perhaps even eliminated, 

i his evolution has occurred most notably in the development of programs capable of 
symbolic integration. An early program by {Slagle, ESS3) had knowledge of a basic set of 
integration formulas but combined this knowledge by means of a search. Recent programs 
following Moses design of SIN (Moses, have a deeper understanding of the nature of 
integration and do not require any Search to rind the integral. We do not necessarily expect 
language programs to evolve to the extreme form of deterministic problem solving that the 
integration programs have reached, but we do expect a Similar kind of evolution in the same 
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direction. 

Indeed, insight Into grammatical knowledge and knowledge representation techniques has 
reached ihe point where one can seriously consider a backup-free approach. The issue is not 
decided, but the suggestions are promising, For example, a backup-free scheme has recently been 
proposed- by Marcus (1975). He calls his approach 'wan and see parsing* 1 , The basic idea if thnt 
backup is made unnecessary if the system is provided with a limited look ahead of only a few 
words {or phrases} and if sufficient constraint is provided by semantic and pragmatic knowledge- 
He is Currently constructing a case-frame based parser following these guidelines. 

Another proposal by Riesbeck {1975} involves a somewhat more radical approach. Riesbeck 
thinks that grammatical knowledge should be thoroughly integraied wjLh semantic and pragmatic 
knowledge to such an extent thac it is no longer reasonable to speak of separate components. This 
represents the eKtremc posirion .along a natural line of evolution of more and more imimate 
interaction between grammatical and other kinds of knowledge, Riesbeck has constructed a 
comprehension system that reflects this philosophy. Words in the sentence trigger "productions” 
(tailed demons by Riesbeck) whose goal is to build a representation of the sentence's meaning 
(Using Schank’i conceptual dependency structures (Schank, IOT. Some- of these productions reflect 
grammatical knowledge such as generating the expectation for a noun following the appearance of 
an article. Others represent semantic and pragmatic information such as expecting a human 
recipient following art appearance of the word “give'- Rebeck proposes that such an organization 
has both conceptual elegance as well aj the ability to pari* deterministically. 

A new conceptual grammar by Martin (1575) also promises To generate perspicuous, backup- 
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free conversion* from sentence* into expressions in his semantic representation. Martin's work 
suggests the possibility that an inicmal repmentation might be closer to the "surface* Structure of 
natural language than most worker* have believed feasible. 

Clearly there a an important interaction between grammatical knowledge and other kinds of 
knowledge. A purely pipeline model in which a grammatical amity*!* is followed by a semantic 
analysis and finally a pragmatic analysis lj untenable. However, it is not yet clear which 
alternative among the following is preferable; {l) structuring the grammatical knowledge as a 
modular expert engaged in a dialog with other expert* or (2} distributing the grammatical 
knowledge with other kinds of knowledge in such a fashion that it is no lunger appropriate to talk 
about separate syntactic, semantic and. pragmatic Components. Tn an} 1 case, it is our expectation that 
in the next few years language systems wjl] be constructed in which backup dues not generally 
occur (except in the ca*e of garden path sentences which are equally confusing to people}. Thts is 
based on the observation that most procedural theories initially require backup, not because of the 
Intrinsic properties oa the problem area, bin rather only became sufficient knowledge to 
disambiguate alternative* ha* not yet been incorporated into the system. Thu knowledge has 
gradually been formaliied (a* our earlier discussion of work by Winograd and Woods showed). 

I 

Consequently, we are evolving towards the Stage where knowledge is available to make the right 
choice* at each decision point. 
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S. The Classical, Tl sp mar-lic an d modern Periods pT HI 

6-1 A Change in P-er&p-ectivg 

In this essay, we have tried to give the flavor pf the new perspective from which At 
approaches traditional problems in linguistics and epistemology. Specific forms of this general 
change in perspective ire relaxed to a number of theoretical issues: 

' Che relative Importance of procedural as apposed, usually, to ""declarative' gr 
"propositional' knowledge; 

‘ ihe shift away from traditional logistical concepts (axiomitna Lions. log ml 
deduction) for the representation and manipulation of knowledge to more 
highly structured and particular representations; 

- liberation from the need for highly uniform transformational method; by 

allowing procedural sentries or "frame-keepers' to populate the knowledge 

I . 

system. 

In each case we note a trend away front a traditional form towards a newer form. In each case 
experience has already shown us that such trends can go too far. and that there is often good 
reason fur using rhe traditional forms. So. looting back over the brief history of At, we see a 
dialectic process with three phases which might bE termed the Classical, Romantic and Modern 
Periods of AI. 

In the earliest work (which might be termed the Ci&iSfcdi Period extending from the mid' 
fifties through the early siJitLei}. knowledge IS represented in very traditional ways: for example as 
subject - predicate propositions Of as important hut relatively non-fundamental extensions af this 
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to property lists. Reasoning is accomplished Through uniform techniques such as search or theorem 
proving. 

Typical prog rami of Ihe Classical Period were often criticized by laymen on line grounds cliat 
"people don't think like that." (Of course, this reaction may be corally wrong, since people do not 
necessarily know what deep processes lie under the cdoscihis layers of thinking.) However, whether 
or rot these objections are right in principle, the programs of what w C shall term the Romantic 
Period {extending' frona the late sixties to early seventies.) do not evoke them nearly to the same 
extent. On the contrary, they sometimes appear to be nothing more than a new language for 
introspective material Striking examples arc the Ivans ANALOGY program (Evans, 19 & 8 ) and 
the Winston learning pTOgram (Winston, IS^O). 

This stage might be justifiably termed the Romantic Period for two reasons. One is the 
extent to which it represented an introspective endeavor, while the second is a notable tendency to 
argue for first one approach and then another as a complete theory of intelligence. The period 
began with an emphasis on network-like systems (QuiIlian, ]3&7> and ended with a concentration on 

Che representation d( knowledge m an entirely procedural form (Hewitt, iflTXfc Wtnograd. 1972). 

" ‘ I ' ■ • ■ : 

Finally, in this the Aferfern Period, we begin to investigate more rigorously the conditions 
under which different kinds of knowledge are appropriate. Frames are extended from an 
essentially declarative data type 10 a procedural sysiem through the incorporation of t rame-keepers. 
Procedures, in order to allow debugging, art embedded in a web of declarative commentary 
Specifying the purposes, requirements, bogs and effects of programs. Procedural knowledge IhuS 
serves as a basis for action while propositional knowledge provides a basis for understanding Che 
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behavior of procedures and especially for knowing how ;o debug or change them. 

S,2 Some Unanswered Technical teaues in Prame Theory 

We have discussed a variety of fundamental epistemological notions - frames procedural 
knowledge, control structure - and have uLliisd these concept to provide a unifying (bread to 
recent AI natural language research, We conclude this section with a collection of questions 
surrounding the frame idea which have yet to be answered and which we feel will provide the 
focus for the next round of research in the field, 

i . - ■ . . 

Learning: 

A theory that makes extensive use of pie-stored knowledge obviously faces a critical question: 
• | 

how is such knowledge obtained? Or, in other words, how does such a theory allow iioveI 
situations CO be comprehended? 

Reference and Reasoning- 

Given that the system knows that feathers can be used to tickle (i-e- could successfully answer 
a question to that effecr), it still remains to describe how indirect reference or reference through 

i . ' - 

description is accomplished Presumably, there are some tickling instruments that we have never 
before considered, bur that we nevertheless are able to understand in that role. How are such 
inferences made? 


Difficulties with Sentries: 
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There are A variety of technical problems that, arise in implementing the ide? of a sentry as a 
demon. These include handling conflicts {i.t situations where more than one demon is triggered 
by the current situation and each has an alternative meaning), forgetting (i.e. when should a 
demon become inactive 1 ), and COfitext-switching (i.e. how are leaving, returning and combining' 
contexts handled). 

Representation: 

Jn many ways, the frame idea has served as A metaphor For us to present various ideas about 
she representstion of knowledge “ both declarative and procedural. However, there are many 
representation question? that must be resolved in formallimg the notion, Tor example, how 
homogeneous is our representation of language to be? Are sentries (demons) to be represented in 
the same form as the basic frames? Are current computer languages such aj Lisp adequate or do 
we require new programming constructs? 

Control. 

Is .knowledge of control implicit or explicit in a frame systemr' Control u explicit when in 
addition to basic facts the system also includes knowledge about When to apply these facts and how 
to resolve ennflicit Control thus becomes just another kind of knowledge to the system. On the 
Qtncr hand, control is implicit through the representation of knowledge as procedures, with 
ordering in the program, conditional* and calls determining the sequence of application Which 

i 

approach LI to be preferred 1 

Level Interactions: 


We have pushed the existence of frame'based knowledge for verbs, nouns, contexts, snd 
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themes. Technical questions arise about how these various levels of knowledge interact Is the 
organization essentially fitfrsj-e^tfcf. with each level looking only at the results of the interpretation 
al the level below it. Or CO aX levels examine the surface linguistic input (perhaps in addition CO 
esaminin^ ra.Ch otherjr .s communication occasionally /itScTsnhicd wjjh higher levels someTimes 
directing the analysis of lower levels and at Other limes being tn turn directed by the results of the 
lower level analysis? 

Thu represents the conclusion of our Theoretical discussion of At as a new style of 
epistemology and its impact on the Understanding of language and comprehension. En the ITCKt 
section* we consider the relevance of these ideas to education. . 


I 
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1 . The Impact of ilrNliceaS Inteljjjqenac. or. Education 

7.1 If Ehe MachiiriEi-s Get S m arter, Wilt the Children Sot Smarter? 

t he relevance of Artificial Intelligence to Education can be discussed along two different 
dimensions corresponding to the narrow and broader views of the subject as "machine intelligence” 
and as ''epistemology'. The former view evokes the prospect of more useful machines. sometimes 
described under the label "Intelligent Computer-Aided Instruction" (Brown, 1975), The oLh er view 
has possible consequences of a much deeper sort, for it can be taken as projecting a total 
restructuring of the content and the theory of education But even if this radical prediction is not 
fulfilled, the cognitive style of Artificial Intelligence has very immediate and realistic suggestions 
about what children should be taughc and how. 

These two Views can be stared in a more dramatic way. The organilers of the HSJMRON5F 
conference on Ten Year Predictions of the impatt of Computers on Education’' (HUMRO, 1975} 
those as the theme of one session: "[f the machines gel smarter, wjll the children get smarter?". 
We think the deeper question a thLi: "Suppose that ri^cnr machines can be constructed. Should 
educators look to the machines themselves as instruments to make kids smart, or should they rather 
look at the principle; which led to the machines becoming more intelligent to sec whether these 
principles could be applied to children?" There it, of course, no need to choose. Our inienfion in 
SO posing the question is to draw attention to the richer, hut almost entirely neglected, second view. 

This paper is not the place for a full discussion of the relevance of Artificial Intelligence to 
c-SUCation. Instead, we shall develop briefly three representative issue*: 
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(]) the impact on education erf !hE development of Comp user-loaded tutors with 
reasonable language comprehension abilities. We than develop this topic 
under the heading IntetUgiM Computer-Aided Imttiumn (iCAE}. 

(2) the deeper question of whether the content of At theories of language can 
have an impact on the very substance of linguistic studies in the schools, not 
just in the mannetyifl which the current material is taught. We have chosen to 
Call this topic Tfie Gfaii Bex Approach Jo Lungti-sgr 

(3) whether the basic goal of AI in seeking unified theories or knowledge can 
loe expected to have a profound impact on the entire educational curriculum, 

with the effect of decreasing the currently accepted division of education into 

■ ' I 

disparate subject areas. We shall discuss this topic under the title "Tho 
Articulate Learner* 

Again we remind the reader that our discussion is intended Quip to give the flavor of a few 
kinds Of interaction that Artificial Intelligence and Education might have. For a discussion of 
relates areas of research in psychology, linguistics and AI. see (Win^rad, 1976}. For a deeper 

discussion of the particular Interactions discussed here, see (Minsky, Pipert, 1S7* Pa pert. ]$72i 
Goldstem, 19'7'i). 
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7,2 Intelligent Computer-Aided Instruction 

The question cited earlier "If the machine* gCL smarter. will the children get smarter?" 
immediately evokes the comment "certainly not automatically". This follows from the deeper 
observation chat the question ii not primarily about machines, but rather about the nature of 
human intelligence and the effectiveness of teaching- To see this, transpose it into the rwo sub- 
questions; 


(I) "Could an idealiied personal human tutor make a kid *martei?" and 

(2; “Could all or some of the functions performed by this tutor be mechanized?" 

Answers to the first question will vary depending on what one thinks about controversial 
issues in the theory of ■(human) intelligence, But there would certainly he less variation in answers 
to the question: 'Could a highly intelligent, empathetac personal tutor enhance the intellectual 
development of children?" (Whether this enhancEment is to be described as increasing the child's 
intelligence or the effectiveness of his use of it j| not germane here.) 

There are very few programs that can lay claim to fulfilling the functions of an intelligent, 
insightful tutor. However, imporiant ground*breaking research has been done by Carbonell (157&) 
and Collins (1575a) working on the SCHOLAR Project, and. by 3iown (]57i) tn his development of 
the SOPHIE program, , he SCHOLAR Project has developed a series of successively more 
sophisticated computer-based tutors for such domains as geography. The project has involved the 
development of natural language comprehension modules, rhe construction of domain-oriented 
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problem solvers based on a semantic net approach CO representation, and, molt notably, the 
procedural representation of tutorial strategies. 

Recent wort by Collins {13751?) has involved a deep Interaction between psychology and 
Artificial Intelligence in IhE form of studying the kinds of reasoning that people typically use to 
handle questions in ep*n domains. iJt. concern mg' topics about which they know a great deal but not 
all of the relevant facts. People arc typically quite good at making plausible inferences in such 
contests, Collins' goal IS to represent such reasoning -strategies in a procedural way and Chen make 
that knowledge available to the SCHOLAR tutor for the purpose of allowing the tutor to help a 
student debug hu own reasoning style. 

Brown (1974) has developed a computer-based laboratory in which a student can learn 
electronic troubles hooting. The program SOPHIE contains an electronics problem solver that 
embodies a genu me understanding of the circuits Involved, a natural language component [hat 
allows English question-answering t Q occur between the student and program, and an 
understanding of Certain kinds of tutoring strategies. We shall not develop the details of Brown's 
program further, except to remark On the following point;: 

(l) the program ts a milestone in Computer-Aided Instruction (CAt) simply because of the 
esc tent to Which it understands its subject matter. Previous CAI programs (and indeed 
most current ones) are notorious for their ignorance The slightest deviation from the 
expected response on the part of the student in his answer results in the program 
erroneously judging the answer as incorrect, Brown's program utilizes a powerful 
simulation of [he circuits Under study M accurately predict the consequences Of various 
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faults, [his kind of domain competence ls essential as a foundation for any CAE tutor, 
and we espect the incorporation of A[ problem solvers Lnto CAl programs to become 
more and rooie common in the coming years. 

(2) the tutoring strategies of SOPHIE are concerned with the nature of reasoning 
involved in building up a model of a problem from incomplete evidence This is a 
fund a mental kind of human reasoning. ]t addresses the basic issue of whit kinds of 
questions a student should ask in attempting to understand a nr-* subject, device or 
electronic circuit. SOPHIE is able to judge whether a measurement requested by the 
student (as part of the process of tracking down a fault in the Circuit} is a reasonable 
question on the basis of what the student already knows. If not, ie, if the answer to the 
question is already entailed by information previously Obtained by the student, then the 
program draws the Student's attention to the redundancy and discusses the information 
Which the Student knew but filled LO consider. 

This Study dF human cognitive! processes through the formalisms developed by Artificial 
Intelligence and the incorporation of that understanding into CAl programs IS only in the 
formative Stages. But it is our hope that in the coming years, the concept of personal computer- 
based tutors available- to any student at any ejitie will become more and more of a reality 

7.3 The Glass Box Approach, to Langi;a ae _Sludi6& 


in the previous section, we saw the application of Artificial Intelligence research on language 
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comprehension and problem solving to she desLgn of intelligent CAl programs. En this section, we 
develop a more radical uppUatiofi. 

i ' ... , 

The ICAl tutors, no matter how intelligent, are eiHMtally btuk boxei from the student's 
perspective. The possibihry exists that the Cnncepts developed to design those programs might 
themselves be worthy object* of study by the students. We propose calling this 4 gtili box 
approach,. Unlike the "black boa* approach, the point here is precisely to encourage the young 
students jd look inside the box, to ask how ir works and to gam insight into then own language by 
so doing. In using the phrase "Class Box H , wo follow Peele’s (lOT-t) discussion of APL programs 
whose structure is available to the student to be studied and understood. 

1 - , , ’ + 

Art example Of the glass box approach ii illustrated by suggesting a new kind of Language 

■ ’ • i |- - 

Laboratory. We en vision a computer-based environment in Which students would have the 
opportunity to explore the structure of language by designing, programming and testing various 
kinds of language comprehension programs. 

The virtues of such a laboratory approach lie in the active nature of the studeht’s 
involvement, the possibility of the student developing his ideas in a personal way not rigidly 
limited by the teacher’s approach (but reasonably circumscribed by the goals of designing a 
successful program), and the exposure to new computational theories of language that are in many 
ways superior to traditional grammars as conceptual frameworks for the study -of language at 
school 

I ■ 

As an example of the power of these new approaches to language, we mention the possibility 
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Ehai they hold out for a unified approach to the study of knowledge. Tins was illu strand earlier 
hy OUr diiCUiSlon Of Frame I hcory in which I he same basic knowledge-representation techniques 
we:e seen to appiy to grammatical knowledge as well as othEr kinds of knowledge conerrned with 
themes, context* and semantic meanings. The exciting possib-iltry exists of dimintshing the 
compartments luation of school knowledge. Linguistic knowledge becomes less of a "different* 
thing, apart from other knowledge. 

Out senes of roles fur language understanding machines has moved progressively away from 
a focus on the machine to a focus on the linguisric Ideas which make the machine possible. We 
conclude this essay by generahitng this approach beyond the linguissic domain. 

i- I t‘ : . t _■ _ 

7.4 The Articulate Learner 

• ‘ • • a : . 

j . . ' 

in this secuon,. we develop further the notion of a unified approach to educational: curricula 

I - . 

by introducing the notion of Art iculs.it Learning. Osir discussion is motivated by the question of 

j ■ 

Why students who find language Studies in school challenging and who do well in them often find 
.ne mathematics curriculum a confusing and impossible mate. A traditional answer is tliat there it 
such a thing as quantitative mathematical abilities and qualitative language-oriented abilities. 
Such a belief is mirrored in ail of the Achievement Tests that Students arc asked to take In which 
there are clearly demarcated language and mathematics section*. 

We should like to raise the possibility chat the traditional mathematics curriculum of grades l- 
12 lacks certain dimensions which make it understandably obscure compared to language studies. 
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In parLiCul.u, ni baste falling Li in the exiteme difiiculty which u presents for a student or teacher 
who Attempts to be articulate about the problem solving process. 

Iti language studies* a teacher can discuss the many dimensions of a story written by a student: 
Its grammar, its theme, its narrative structure, "he plot, the setting. In contrast, what can a teacher 
discuss about the failure of a Student to solve an arithmetic problem or an integral? for example, 
we have seen a child go to the b|jdboard and solve an arithmetic problem tn the following way: 

35 
+■ 35 

510 

The teacher involved had little co say besides observing that the answer was wrong. We 
believe 1 that the earlier discussion of At and Epistemology has some very definite suggestions for 
altering the mathematics curricula to allow the student and the teacher to be far more articulate 
about Che knowledge and problem solving involved. 

For the above example, we imagine an; environment sn which the first kind of knowledge that 
a student is exposed to is the structure of programs and the related ideas of planning and 
debugging. Skill in design mg procedures lies at the heart of achieving Competence in mathematics 
fand in writing essays as well), Given access to a Vocabulary for programs, plans arvd bugs* we 
believe that student and ifachtr could be articulate about describing the particular algorithm used 
by the Student in reaching the "EKT answer, in identiFying the bugs, and in debugging the addition 
program £o yield a correct result. For the above example, we imagine A discussion similar to the 
earlier description of the HACK£R progra til's reasoning regarding iLS problem solving in the 
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Blocks World. The Student had mitred an addition procedure for single digits. Hiigrat in this 
proto lens is to apply that procedure as a [□ the more complex goal of adding multi-digit 

numbers. His solution indicates that he followed the reasonable Itn t<iv plan of assuming that |h? 
sum of multrdigit numbers could be achieved by adding the columns indt ptndetitly. Linear 
planning commits the problem solver to have some method qF putting together the solutions of the 
sub-problems which were arrived at independently The answer of HSIO - indicates that our student 
utilized the simple artifice uf concatenating the two partial solutions. The tag,, of course, was in 
assuming the complete jndependencc of the column additions. The cifiyggtng is accomplished by 
taking account of this interaction through ordering eta columnar additions rlght-toHeft and 
UlltiJing a "carry" dais. itractWTi. 

The rea.der may find a vocabulary involving such terms as linearity, bug, debugging, 
independence and data structure confusing; and, must certainly believe that such a vocabulary is 
hardly useable with an elementary school student. This is true, of course, given the current 
curriculum. However, our proposal is not Simply to add this vocabulary to the traditional; 
curriculum. It is more far reaching. The proposal is to redesign the curriculum to allow 
introduction of this and related concepts at an early stage. 

This is being done in a preliminary way at MXT.'s LOCO project (papert, 1972) in which 
new comp ufa Eton-based approaches to mathematics, science and music it re currently being 
developed. As an example, elementary school students are introduced to the above set of 
programming concepts and to traditional mathematical concepts in the context of programming a 
computer display to draw pictures of their choice (figure 5 ), 


Kft&wlcdg-c, Language unci A] 


GoldUtejn, Pa pert 


49 



Figure 0 

Typical Pic cures Ueawti by Children 
Fr<itn (Ccldattir. 1974. Pp. 9 and 21} ^ 
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t he children CfrrtstnjcE (he programs themselves in LOGO, k programming language 
expressly denned for students. A successful LOGO experience involves nor only (he student 
being sdf'motivated to -construct such programs (because of (he enjoyment of creating pictures), 
but also involves discussions between student and teacher, between Student and student, and 
between the student, and himself, in which the student becomes verbal About his reasoning- and 
learns to use a vocabulary describing hu plans, his programs, and, most importantly, his bugs, 

The design of new educational environments in which the student is introduced to the 
concepts that AI has I aund essential to the construction of intelligent programs, is only in a 
beginning state. But our experience with children in teaching them Turtle Geometry, Computer- 
based Physics, and Computer-based Cnmposing leads us tn believe that the approach is promising 
and potentially revolutionary, Furthermore, it seems to lead to a situ&uon where students can be 
articulate about their own problem solving, regardless of whether it is in the mathematical or 
linguistic domain, 

i 

7 r 5 Episl&raology anj Ed uc all on 

The role of a i now ledge-oriented A] approach to the study of language comprehension has 
its counterparts in ail areas of cognitive science, and each of these carries impliesTlons for 
education. 


Wc have argued again and again that a major thrust of AI research has been to find 
representations of knowledge which lend themselves tc effective integration into larger knowledge 
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systems. The analogous problem for education is cbvipui; Are the representations of knowledge 
which have proven themselves effective in Augmenting machine intelligence also of value in 
augmenting human intelligence? And, if nett, does- a similar inquiry into the forms of knowledge 
suitable for human thinking yield a different set of modes of represen La Lion? Fuihermor-e, the 
epistemological paradigm for the study of learning asks questions like: What knowledge docs a 
good learner Use in a given situation? What knowledge would make learning a simpler process 
than ic appears tg an observer who does not know this knowledge is being used, or to a learner 
who lacks it? 

Very little attention has been paid to problems of this sort. Hut signs of a rich area of the 
utmost importance seem to be present in the results of such preliminary investigations as the 
construction of Intelligent Compute! Aided Instruction programs containing procedural models of 
human inference and learning, the design of new approaches to ifArrting based on Debugging 
Theory, and the design of new curricula such as Turtle Geometry based on a notion of the 
Articulate Learner. 

It ii our belief that future explorations into the nature of language, of various subject 
domairtE and of the elements of intelligence itself " if pursued from an epistemological perspective 
grounded in the computational metaphor ™ will provide the foundation for -deeper theories of the 
nature of language and of the nature of learning Furthermore, we expect such theories to have 
practical eon sequences in the form of programs That comprehend language and in the design af 
new kinds of computer-based educational environments. 
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Epilogue -- wonbro^srai-cs about Edua ab iljfy and lnnater.es-s 

i 

The gericraE issues about the theory of education disaiaed in section 7 cut across a number of 
acute current controversies about the relations between linguistic abilities in the developing child 
and other cognitive functions including learning. Different approaches to linguistics and to 
cognitive theory appear to lead theorists to very different opinion* on questions such as the 
domain-ipeciftilly of language skills; then relation to particular brain structures; and the extent of 
th*li- mnaienfSj, For example, one sees a very sharp opposition between the Chomskisn and 
Piagetian Sines of thinking (Chomsky, 3975). For the former, there is a definite Ifn^uisdc 
competence which 1* emcmely specific to language (L.e. has very little overlap with Other cognitive 
abilities) and which is drtcrmjned by Innate biological structures. Adherents of the Piagctiart 
school, on the other hand, tend to see linguistic competence as sharing major cognitive processes 
‘with other intellectual domains and io be explicable more In ’Cims of a ps-ycho-developmenral 
process. 

Any such brief summary of complex theories necessarily over-simplifies the respective 
positions. Nevertheless, the oppositions, though more modulated, are real and of fundamental and 
urgent importance. We see w-Qik in Artificial intelligence as having a very direct hearing on this 
debate. This does not mean that we are able to waive rhe issues. Rather we believe chat 
c&nptttitfUrhitt tt] 0 (ieii provide a theoretical overview necessary even for a clear statement oT the 
issues- This belief is reinforced by the increasingly rapid spread of computational ideas into the 
work of psychologies!* and even philosophers whose views fall on both sides of these controversies 
Far example, sec (Factor, 1975} for the computational formulation of an extreme "nativist" position 
by a psychologist who disagrees fundamentally with much of the ’"m a in-line' Ai trends. See 
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{Norman, 1975; Bobrow, 1975) for the views of psych blog tlLs sympathetic to the kind of At theory 
treated in tbJi paper. 

In conclusion, though (he debates, about intelligence and cognition rage both in and out of AT„ 
and the reader may have objected to many of Che specific Observations that we have made 
regarding- frame theory and ocher issues, we believe LhaC there already has been Lhis fundamental 
and pro Found change; that the language of computation hli become the proper dialect for 
discussing rhe banc issues of both psychology and education, 
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